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ABSTRACT 


Obtaining a confidence interval for a parameter A of an exponential 
distribution is a frequent occurrence in life testing problems. Often- 
times the test plan used is one in which all the observations are censored 
at the same time point to: Several pprOR mate confidence interval pro- 
cedures are available in the statistical literature; however, to the 
knowledge of the author, the performance characteristics of the various 
approximations used in these procedures have not been established 
analytically. The purpose of this paper is to report the results of an 
empirical stucy of the performance of four of these procedures with 
respect to the expected length of the interval, the variance of the 


interval length, and the coverage probability. 
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IT. INTRODUCTION 


In life testing applications, it is frequently desired to obtain a 
confidence interval for the parameter \ of an exponential distribution. 
In case a test plan is used for which all the observations are censored 
at the same time point ce several approximate confidence interval pro- 
cedures are available in the statistical literature. To the knowledge 
of the author, the goodness of the various approximations used in these 
procedures, and hence the goodness of the procedures themselves, have 
not been established analytically. The purpose of this paper is to 
report the results of an empirical study of the performances of four of 
these procedures with respect to the expected length of the interval, 
the variance of the interval length and the coverage probability. 


The general setting of the problem is as follows: suppose the random 


variables T,, T., -, L are independent and identically distributed with 
1 2 n 
common exponential distributions | 
foc.) — pe CER LO Piva tee wae ie 
i a i 
The random variables that will actually be observed are Xi Xo ee X > 


where 
ee et 
iL rh O 
BIE UE SE 5 eo at: 
These random variables are sometimes referred to as "censored" exponential 
variables, and tS is called the "censoring point". Define the random 


variables Y x 


1? *9? ; Zs by 


iy 1 Xe 
1 O 


Gore X, 2 t\3 Ac 2 ee ey ale 





It is easily seen that the random variables Yy> Yo > ee 2 are independent 


ae In Section II, 


and have Bernoulli distributions with parameter p = l- 
four confidence interval procedures for X based on the X. and Y5 are 


discussed. 





II. CONFIDENCE INTERVAL PROCEDURES 


A. PROCEDURE 1 

This procedure is obtained as a special case of a solution to a more 
general problem that has been derived by Halperin [1]. The random 
variable Y = LY has a binomial distribution with unknown parameter p. 
Suppose y is the observed value of the random variable Y, then the 100 


(1-a)% confidence bounds Py and may be obtained by solving the 


PL 


equations 


Poa | a/2 
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and 


a/2 


P {[Y > y] 


2 EP (1-p)™) 


for p. Due to the discreteness of the random variable Y, these equations 
do not yield an exact 100(1-a)% confidence interval. The confidence 
coefficient is at least 100(1l-a)%. The confidence interval for A is 
easily computed by inverting the relation p = l-e Ms to get 
Ae = -In(1-p, )/t, and hy = -In(1-p,)/t, 
A useful tool in solving for Py and Py is the Incomplete Beta 
mumetion, 
2 er 
+B+ : ~i 
F(x;0,8) = a 22 eae Sa oF aoe et Gee 
i=o 

The upper confidence bound, Pry? is obtained by solving 

F(pys yon-y-1) = l-a/2; 


and the lower bound, Py» is obtained from 


F(p,3 yon-y-1) = a/2. 





The simplicity of the computation involved in this method is very 
apparent; however, it should be noted that the observed values of the 
random variables Xy> Xo» ides xX have been disregarded. Intuitively 
it would appear that this procedure will not produce as good results 
as one which incorporates all the information of the experiment. How- 


ever, due to its simplicity, this procedure merits consideration. 


B. PROCEDURE 2 

For A es iy = LY is nearly a Poisson random variable with 

A i ee 
parameter A}2X,. Rubenstein [2] shows that = =| 1 Se eS ct 
i 2X 2n 

nearly unbiased estimator for A, where the second factor is used to 
correct for the bias of the otherwise minimum variance estimator 
Y/2K . . Wilks [3] states that for a Poisson random variable P with 
parameter 09, the following relationship is approximately true: 
o-P = Zo where Z is the standard normal random variable. Rubenstein 
applies this reasoning to the random variable Y along with his modified 
estimator 2 to obtain the relationship A -X = Z/d . this equation is 
solved for hy and he by using the appropriate standard normal percentage 
point corresponding to a 100(1-a)% confidence level. The resulting 
solutions for a and hy are 
2 


2 
A= | 2 EeCmmnCazecln zie.) 7 


and 


/2 


f Z where C = (EX, 


1/2 
2 i; 
C+ (4h2zC + Z c*) 


A 
by (Be 2 
C. PROCEDURE 3 
This procedure is a modification of one which has been suggested by 


Birnbaum [4] for application to uncensored life testing problems. Using 


the terminology commonly used in the literature of life testing to 





facilitate the explanation, imagine that the random variables Xi Xo » 
ms xX are observed sequentially. That is, a randomly selected item 
is put on test and is replaced with a similar item at failure or after 
a period of time t has elapsed, whichever occurs first. Then the 
arrival process of failures is a Poisson process. Suppose the experi- 
ment is continued until a total of k failures have been obtained. It 
is well known that the individual inter-arrival times of failures are 
exponentially distributed and the time to the a failure has a gamma 
distribution. Consider a test plan in which the experiment is stopped 
after a random amount of time (as would be the case for example, if n 
items were to be tested). The number K of failures is then a random 
variable. However, it would appear that, given K=k, the distribution 
of the time, Wy until k failures have arrived can be approximated by a 
gamma distribution. More precisely, suppose the observed value of the 


k k 


random variable K= %& Y,. is k, and let We = 2 x. Then the con- 
i=] real 


ditional distribution of Wh» given that K=k, is approximately the gamma 


distribution 


k 
f£(w) = os io a Ww > ON 





It follows that V = 2rW. can be approximated by a Chi-square variable 
2 Z 
with 2k degrees of freedom. Thus, if x l-a/2 and x a/2 are the upper 


and lower a/2 percentage points of the Chi-square distribution with 2k 


Z Z 
ye SQleei ey 


ZW. ZW 


100(1-a)% confidence interval for }). 


degrees of freedom, then constitutes an approximate 
Birnbaum also suggests an estimator for X which is merely the average 
of the upper and lower confidence bounds. This estimator has also been 


computed and tabulated. 
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D. PROCEDURE 4 


In Procedure 3 the distribution of the random variable Wee the waiting 


; th ; ; ; ; 
time until the k failure, is approximated by a gamma distribution. 
Since the test is terminated after n items are tested, the maximum value 


that Wy can attain is nt .. Consider the conditional probability 


P (Ww < w and We 1 [W, < w] 
E 


P [W. < nt] 


] 
| = O 
P{W < w/W, < nt] PTW Ss 


— 


ae 
- 
This yields a truncated gamma distribution having density function 


f (w) 


F(nt .) O < w< nt 


fe) 
£,(w)= 


O w< 0, W > nt . 
Where f(w) is the density function of the gamma distribution and 


nt 
Oo 


F(nt |) = { fiw) dw. Intuitively, this new function would seem to 

O 
approximate the probability distribution of Wy in a censored test. An 
obvious drawback of this method is the difficulty of computation since 
both X and F(nt .) which depends on X are unknown. Thus, given tables 
of the Chi-square distribution function one is forced to use an itera- 
tive method for obtaining the desired i such that 

2 2 
ea ae 2hw, | P {x < 2hw, | 


= o/2 Ew 1-0/2 
P fx’ < 2hnt | P {x < 2hnt | 


to obtain Ay and My respectively. 
The estimator for A’ described in Procedure 3 is applicable to this 


procedure and is also tabulated. 


10 


III. COMPARISON OF PROCEDURES 


A Monte Carlo study has been made to compare Procedures 1, 2, and 3 
described above. One thousand samples of size n (n = 30, 40, 50) from 
an exponential distribution with parameter A (A = .1, .2, .8, 3, 5, 10) 
have been generated. For each sample, confidence intervals for 4» have 
been obtained by using each procedure for various censoring points ty 
and confidence coefficients (l-a) (a = .05, .01). The tabulated 
quantities are the average length of the confidence interval, the sample 
variance of these lengths, and the empirical coverage probability of the 
intervals (i.e., the proportion of intervals which actually covered i). 

An abbreviated Monte Carlo study has been made to compare Procedures 
3 and 4. The lengthy computation of Procedure 4 required that the number 
of repetitions be reduced to 100 and that fewer combinations of 1, t_,n, 


O 


and q be used. 
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IV. CONCLUSIONS 


The results of this study differentiate each procedure as to its 
merits and shortcomings with respect to certain life testing situations. 
Each method is discussed below in order to define the situations in which 
it could be used. The following discussion includes only comparisons for 
Procedures 1, 2, and 3. Procedures 3 and 4 are compared separately and 
to a lesser degree due to the differences in the Monte Carlo studies made. 

Procedure 1 performed as expected; it generally gave less accurate 
results than the other procedures with respect to all three quantities-- 
expected interval length, sample variance of this length, and empirical 
coverage probability. However, since the empirical coverage probability 
of this procedure tends to be conservative and since the computation 
required for this procedure is minimal, Procedure 1 would be favored in 
cases wher. a quick but dependable confidence interval is needed or when 
a rough estimate is needed for use in more sophisticated procedures. 
Disregarding the observed value of the random variable 2X is the main 
reason for the conservative results. 

The overall performance of Procedure 2 seems to rank it first among 
those studied. However, when oe is near 1/X Procedure 3 appears to give 
comparable results with less computation. 

Procedure 3 performs very well in cases where C is approximately 
equal to 1/A; however, when a 1/X few failures tend to occur and 
the random interspersing of censored times causes the sample variance 
of the procedure to be high. In general, the empirical coverage prob- 
ability for this procedure is close to the chosen confidence coefficient. 


The actual computations needed are comparable to those for procedure l. 


12 





The estimators for A defined under Procedures 2 and 3 have been 
tabulated; the one given under Procedure 2 seems to be a nearly unbiased 
estimator for X as stated. The estimator arising from Procedure 3 is 
consistently greater than the true value of i. 

The comparison of Procedures 3 and 4 are based on an abbreviated 
Monte Carlo study. This is due to the accuracy limitations of the 
Computer when computing small values of i and also the added dependence 
upon the time till ep. failure in the iterative step of the computation. 
A more extensive study will be necessary to obtain more meaningful 
results. It appears that the interval length obtained from using Pro- 
cedure 4 is not as good as that of Procedure 3; however, the empirical 
coverage probability for Procedure 4 seems to be close to the desired 
confidence coefficient and the estimator for X appears to provide a 


nearly unbiased estimator. 
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V. COMPUTATIONAL PROCEDURES 


The IPM/System 360 Model 67 computer with the Fortran IV programming 
language has been used for all computations. 

The exponentially distributed random sample is obtained by first 
generating random numbers from a uniform distribution on the interval 
(0, 1) and then by inverting these numbers to get a random sample from 
an exponential distribution with parameter X. This is done by using the 
fact that if a random variable Z has an exponential distribution with 


distribution function 


O Z< oO 
-\Z 
F(Z) = P[Z = zj} = l-e OS Zak 
i 7 all 
the rando variable X = jee has a uniform distribution on the interval 


(0, 1). Thus, if x is a random number from the interval (0, 1), the 
number z = -1n (1-x)/\ is a random sample number from an exponential 
distribution with parameter i. The subroutine RANDU of the IBM 
Scientific Library is used to generate these random numbers. 

The second step in the computing procedure requires a simulation of 
censored testing. Therefore, each random number z is compared with the 
pre-determined censoring time to and a counter is used to obtain the 
number of failures k = LY. For Procedure 1, only the value of k is 
needed; however, Procedures 2, 3, and 4 require additional information. 
If the value of z exceeds the value of to: it is disregarded and replaced 
by t 43 these values of Ee and z which are less than t are summed to 
obtain the observed value of the random variable 2X for use in Procedure 
2. The necessity of order in Procedures 3 and 4 require the random 


numbers to be dimensioned in an array to maintain the order in which 


14 








they are generated. This array contains a random ordering of z's and 
as which are summed to the point of the pa failure after the entire 
sample of size n has been generated. This sum contains some of the 
truncated times tO but generally not all and it contains all z < to 
The value of this sum is the observed value of the random variable We 
of Procedures 3 and 4. 

The computation of Procedure 2 is trivial after the required normal 
deviate is read from a data card. However, due to the high number of 
possibilities for the value of k (0 < k < n) subroutines of the Beta 
and Chi-square distribution functions are used. These subroutines are 
written as cumulative distribution functions and give only the resulting 
probability, given the required input parameter. Since the input parameter 
XA is unknown, it has been required to write an additional function sub- 
program to iterate toward the desired parameter given the confidence 
coefficient. The restrictions on the input parameters for these sub- 
routines force the cases of k = 0 and k = n to be ignored. This is done 
by disregarding the sample which produces k and by accounting for it in the 
value of the number of repetitions. The IBM Scientific Library subroutines 
BDTR and CDTR along with the function subprograms written by the author are 
used for these computations. 

For each of the thousand samples the confidence interval length is com- 
puted and is tested to see if it actually covers \}. The final number of 
those covering X\ is divided by the number of repetitions yielding the 
empirical coverage probability. The sample variance is computed by using 
Ehe relation Pe aC: (=x) (m=) where x is the average length of the 
intervals, Ks is the length of the ae interval and m is the number of 


repetitions. 
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